Clustering mass spectrometry data using order statistics.
نویسندگان
چکیده
Mass spectrometry data is inherently uncertain. Rather than compare peak heights across samples, a comparison can be made of the relative ordering of the peak height across samples. Order statistics are used to provide a distance metric between each ordered list of peak heights from the samples. A principal component analysis is performed on the set of distance vectors to highlight to important components.
منابع مشابه
Normal-Gamma-Bernoulli peak detection for analysis of comprehensive two-dimensional gas chromatography mass spectrometry data
Compared to other analytical platforms, comprehensive two-dimensional gas chromatography coupled with mass spectrometry (GC×GC-MS) has much increased separation power for analysis of complex samples and thus is increasingly used in metabolomics for biomarker discovery. However, accurate peak detection remains a bottleneck for wide applications of GC×GC-MS. Therefore, the normal-exponential-Bern...
متن کاملTesting for Multivariate Normality in Mass Spectrometry Imaging Data: A Robust Statistical Approach for Clustering Evaluation and the Generation of Synthetic Mass Spectrometry Imaging Data Sets.
Spatial clustering is a powerful tool in mass spectrometry imaging (MSI) and has been demonstrated to be capable of differentiating tumor types, visualizing intratumor heterogeneity, and segmenting anatomical structures. Several clustering methods have been applied to mass spectrometry imaging data, but a principled comparison and evaluation of different clustering techniques presents a signifi...
متن کاملThe effect of H2SO4 – amine clustering on chemical ionization mass spectrometry (CIMS) measurements of gas-phase sulfuric acid
The state-of-the art method for measuring atmospheric gas-phase sulfuric acid is chemical ionization mass spectrometry (CIMS) based on nitrate reagent ions. We have assessed the possible effect of the sulfuric acid molecules clustering with base molecules on CIMS measurements using computational chemistry. From the computational data, three conclusions can be drawn. First, a significant fractio...
متن کاملEffective peak alignment for mass spectrometry data analysis using two-phase clustering approach
In recent years, mass spectrometry data analysis has become an important protein identification technique. The mass spectrometry technologies emerge as useful tools for biomarker discovery through studying protein profiles in various biological specimens. In mining mass spectrometry datasets, peak alignment is a critical issue among the preprocessing steps that affect the quality of analysis re...
متن کاملBi-clustering of metabolic data using matrix factorization tools.
Metabolic phenotyping technologies based on Nuclear Magnetic Spectroscopy (NMR) and Mass Spectrometry (MS) generate vast amounts of unrefined data from biological samples. Clustering strategies are frequently employed to provide insight into patterns of relationships between samples and metabolites. Here, we propose the use of a non-negative matrix factorization driven bi-clustering strategy fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proteomics
دوره 3 9 شماره
صفحات -
تاریخ انتشار 2003